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The genome organization of porcine respiratory coron- 
avirus (PRCV), a newly recognized agent which has a 
close antigenic relationship to the enteropathogenic 
transmissible gastroenteritis virus (TGEV), was 
studied. Genomic RNA from cell-cultured PRCV 
(French isolate RM4) was used to produce cDNA 
clones covering the genomic 3’ end to the start of the 
spike (S) glycoprotein gene (7519 nucleotides). Six 
open reading frames (ORFs) were identified that 
allowed the translation of three coronavirus structural 
proteins and three putative non-structural (NS) poly- 
peptides, homologous to TGEV ORFs designated 


Introduction 


A new porcine respiratory coronavirus, tentatively 
designated PRCV, which has a close antigenic relation- 
ship to the enteropathogenic transmissible gastroenteri- 
tis virus (TGEV), suddenly emerged in 1983 to 1984 and 
spread within less than 2 years in most if not all European 
countries, where is now persists enzootically (Pensaert et 
al., 1986; Brown & Cartwright, 1986; Jestin et al., 1987; 
Have, 1990). A cytopathic agent has been isolated in cell 
culture and pathogenesis studies have shown that it 
replicated at a high titre in the respiratory tract, but toa 
very low extent in the gut (Pensaert et al., 1986). The 
significance of this virus as a pathogen is still unclear. 
Although it was first considered as non-pathogenic 
(Pensaert et al., 1986), subsequent investigations have 
linked PRCV with field outbreaks of respiratory disease 
(Jestin et al., 1987) and with pneumonia lesions following 
. an experimental infection (Duret et al., 1988; O’Toole et 
_al., 1989; Van Nieuwstadt & Pol, 1989). Conventional 
serological tests do not distinguish between PRCV- and 
TGEV-infected animals. However, antigenic dissimilar- 
ities were evident on examining the heterologous 
reactivity of monoclonal antibodies (MAbs). PRCV was 
found to be non-reactive towards several non-neutraliz- 
ing anti-TGEV MAbs directed against the S (spike) or M 
(membrane) proteins (Callebaut er al., 1988; Garwes et 
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NS3-1, NS4 and NS7. Pairwise alignment of PRCV 
nucleotide and amino acid sequences with sequence 
data available for three TGEV strains revealed a 96% 
overall homology. However, the genome of PRCV 
exhibited two important distinctive features. The first 
was that the S gene lacked 672 nucleotides in the 5’ 
region and encoded a truncated form of the S 
polypeptide, and secondly, the first NS ORF down- 
stream of the S gene was predicted to be non-functional 
as a consequence of a double deletion. The significance 
of genomic deletions with respect to tissue tropism and 
evolution of coronaviruses is discussed. 


al., 1988); neutalization-mediating epitopes unique to 
TGEV have been identified also (Laude ef al., 1988). 
Conversely, anti-PRCV S-specific MAbs have been 
shown to differentiate between the two viruses (O. 
Deriabine, R. L’Haridon, J. Gelfi & H. Laude, 
unpublished data). Examination of the structural poly- 
peptide pattern of PRCV virions (TLM83 strain) shown 
by immunoblotting did not reveal any significant size 
discrepancy as compared to the TGEV Purdue strain 
(Callebaut et al., 1988). 

The molecular characterization of this newly recog- 
nized agent has been undertaken in the laboratory with 
the aim of gaining information with respect to two 
important questions: what is the phylogenic origin of 
PRCV?, and which gene(s) control(s) the expression of 
the enteric tropism of TGEV? In the present study part 
of the genetic information of PRCV was determined and 
compared to available TGEV sequence data. We showed 
that although the known part of their genomes are nearly 
identical, one major difference exists in both a structural 
and a non-structural gene. 


Methods 


Virus and cells. The French isolate RM4 of PRCV at four passages in 
the swine testicle (ST) cell line (Duret e7 a/., 1988), supplied by Rhéne- 
Mérieux Laboratoire IFFA, was used as a virus source. The virus was 
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adapted to the pig kidney (PDS) cell line (three passages), and then 
plaque-purified. Infected cultures were maintained in Dulbecco’s 
MEM plus 5% foetal calf serum and incubated for 40 h at 38°C. 
Infectivity titres measured in ST cells averaged 10° p.f.u./ml. 


RNA extraction. Clarified supernatant from the infected PDS cell 
culture medium was used as a starting material. The virions were semi- 
purified by ultracentrifugation through a 10 ml 25% glycerol cushion 
(Beckman rotor SW28, 60 min at 25000 r.p.m.). Pellets were treated 
with proteinase K plus 2% SDS and RNA was phenol-extracted as 
described (Rasschaert et a/., 1987). RNA material was precipitated 
overnight by 2 M-LiCl at — 20 °C then washed twice with 80% ethanol 
and resuspended in double-distilled water. 


cDNA cloning. The strategy and protocols were essentially as 
reported (Rasschaert er al., 1987). Briefly, purified RNA was copied by 
reverse transcriptase (Moloney murine leukaemia .virus, Bethesda 
Research Laboratories) using the TGEV-specific reverse primer pE2, 
located 47 nucleotides downstream of the S gene (5° CATCATCCT- 
TAACAAAATTCTCTAGCAGAA 3’). RNase T2-treated cDNA- 
RNA hybrids were dC-tailed and inserted into PstI-cut dG-tailed 
pBR322. Transfection of Escherichia coli RR cells and screening of 
clones by colony hybridization with TGEV cDNA clones pTG2.18 
and pTG2. 26 (Rasschaert et al., 1987), were performed using standard 
methods (Sambrook e7 al., 1989). Two additional clones (designated 
pTVR) were obtained by insertion of PCR-amplified (see below) 
cDNA fragments into the Smal site of the pTZ vector (Pharmacia) and 
transformation of E. coli NMS522 cells. 


Polymerase chain reaction (PCR). PCR-amplified fragments were 
obtained using cDNA-RNA heteroduplexes as templates and 
20-mer oligonucleotides derived from the PRCV sequence. Taq DNA 
polymerase (Stratagene, 2 units) was employed in the conditions 
described by Saiki et a/. (1988). Thirteen cycles (90 °C, 55 °C and 70 °C 
for 1 min each) were performed in a Hybaid apparatus (Ceralabo). 


Supercoiled DNA sequencing. DNA matrices were prepared accord- 
ing to Lim & Péne (1988). Three pg of plasmid DNA was denatured by 
0:15 M-NaOH plus 0:15 M-EDTA for 30 min at 37 °C, then precipitated 
overnight with ethanol and 0-2 M-ammonium acetate. Sequence 
reactions were performed using the Sequenase kit (USB) and synthetic 
primers (Biosearch 8600 apparatus). 


Analysis of viral polypeptides. Radiolabelled cell extracts were 
prepared as described (Laude er al., 1986) with the following 
modifications. ST cell monolayers were infected or mock-infected by 
PRCV at a multiplicity of 1 p.f.u. per cell and maintained in Eagle’s 
MEM plus 2% foetal calf serum. At 3 h post-infection, [?>S]methionine 
was added at 100 uCi/ml. Fifteen h later the cells were scraped into 
RIPA lysis buffer and immunoprecipitation was performed using a 
mixture of three anti- TGEV MAbs: 51.13, 22.6 and 3.60 (Laude et al., 
1986). Resulting samples were analysed on a 9 to 20% polyacrylamide 
gradient SDS gel plus 5% 2-mercaptoethanol. 


Results 


Generation and mapping of the PRCV cDNA clones 


PRCV genomic RNA extracted from semi-purified 
virions was reverse-transcribed after priming with a 
TGEV-specific sequence downstream of the S gene. 
Heteroduplex RNA-DNA material was then introduced 
into the pBR322 vector and cloned into E. coli. The 
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Fig. 1. Restriction and gene maps of PRCV genome 3’ region. The 
positions of the cDNA clones used for sequencing are shown. pTG2. 26 
and pTG2. 18 (dotted lines) are TGEV cDNA clones that were used for 
screening. Enzyme sites: Bg/II (Bg), HindIII (Hi), Hpal (Hp), PstI (Ps), 
Pvull (Pv), Xbal (Xb). 


clones to detect the S or the M and nucleocapsid (N) 
genes, respectively (Rasschaert et al., 1987). The pVR12 
and pVR26 clones arose from non-specific priming (Fig. 
1). This appears to be the case also for the clones covering 
the S gene, as the TGEV reverse primer used (derived 
from Purdue virus sequence) was found to match 
imperfectly with the PRCV sequence, in particular at the 
3’ end. The clones pTVR10 and pT VR25 were obtained 
subsequently through PCR amplification of portions of 
the PRCV genome delimited using appropriate am- 
plimers derived from partial sequencing. Restriction 
analysis of the pVR and pTVR clones permitted the 
construction of a continuous map covering approxi- 
mately 7-5 kb of the PRCV genomic 3’ region. 


DNA sequencing and analysis 


The insert of each of the clones specified in Fig. 1 was 
sequenced directly on plasmid supercoiled DNA and in 
both directions. Translation of the resulting 7519 
nucleotide long sequence allowed the identification of six 
ORFs in the viral sense strand (Fig. 1; nucleotide and 
amino acid sequences displayed in Fig. 2). The largest 
three were predicted to encode the typical coronavirus 
structural polypeptides S, M and N. The other three 
were found to be homologous to TGEV ORFs encoding. 
putative non-structural polypeptides, and were desig- 
nated NS3-1, NS4 and NS7 as discussed later:. The 
repeated sequence 5’ CTAAAC 3’ found upstream of 
each ORF, assumed to act as a signal for the transcrip- 
tion of subgenomic RNA in coronaviruses (review: 
Spaan et al., 1988), is identical to that of TGEV 
(Rasschaert et a/., 1987). The extreme 3’ non-coding 


region comprised an 8 nucleotide sequence conserved in 
coronaviruses, and ended in a poly(A) stretch. 
Pairwise alignment of the PRCV sequence data with 
those available for the three different TGEV strains 
Purdue 115 (Laude et al., 1987; Rasschaert & Laude, 
1987; Rasschaert et al., 1987), FS772/70 (Britton et al., 
1988 a, b, 1989) and Miller (Wesley et al., 1989; Wesley, 
1990) revealed a very high level of homology. Thus, 96% 
overall homology was found between PRCV and TGEV 
Purdue strains at both the nucleotide and amino acid 
level. However, several features unique to the PRCV 
genome were apparent. The most striking are three 
deletions affecting ORFS: (i) a 672 base in-frame 
deletion (position marked 1 in Fig. 2), located 59 bases 
downstream of the initiation codon of the S gene and (ii) 
two deletions which result in the alteration of the ORF 
situated immediately downstream of the S gene in the 
TGEV genome, a deletion of 38 bases (at position 2 in 
Fig. 2) removing both the CTAAAC sequence and the 
ATG initiation codon, and a deletion of 36 bases (at 
position 3) removing part of the body sequence. 
Moreover, two short deletions of two and six bases 
(positions 4 and 5) were seen in the PRCV 3’ non-coding 
region by comparison to the Purdue and FS772/70 strain 
sequences (not yet available for the Miller strain). To 
summarize, the observed differences between PRCV 
and TGEV genomes consisted of deletions and point 
mutations only; no sequence unique to PRCV was found. 


Estimation of M, for PRCV S protein 


The above sequence data predicted an M, of 135K for the 
PRCV S apoprotein instead of the 158K for TGEV S. 
An experiment was designed to compare the relative size 
of PRCV and Purdue virus polypeptides by SDS-PAGE 
(Fig. 3). Both the PRCV S species and its intracellular 
precursor S’ exhibited an increased mobility as compared 
to TGEV species S (220K) and S’ (175K), formerly 
named E2 and E’2 (Laude et al., 1986). An M, of 190K 
was estimated for the PRCV S polypeptide. The two 
viruses showed no significant difference in M, for either 
polypeptide N or M. 


Discussion 


In this study we cloned and analysed the 3’ region of the 
genomic RNA of a French isolate of PRCV, a newly 
recognized porcine pneumotropic coronavirus which is 
antigenically closely related to TGEV but which does not 
induce intestinal lesions. The sequenced region, about 
7500 nucleotides long, stretched from the beginning of 
the S glycoprotein gene up to the 3’ poly(A) tail and 
probably encompasses all the viral genes except for the 


Porcine coronavirus nucleotide sequence 2601 


polymerase gene. Thus a substantial part of the genetic 
information of PRCV was available for comparison to 
TGEV. It appeared that the genomes of the two viruses 
had not only a closely related organization, but exhibited 
also a very low level of nucleotide divergence (4%). 
However, as schematized in Fig. 4, PRCV and TGEV 
differed by two striking features. 

First, the region homologous to the TGEV ORF NS3a 
contained no functional ORF (see Fig. 2 and Results). 
Additional detailed sequence data are needed to ascer- 
tain whether this is a common feature of PRCV isolates. 
However this view is supported by the recent report that 
the British isolate had the same ORF altered (P. Britton, 
et al., IV International Coronavirus Symposium, Cam- 
bridge, July 1989). The NS3a ORF (NS3 for the Miller 
strain) has been shown to be transcribed for three 
different TGEV strains and has the potential to encode a 
non-structural polypeptide which remains to be charac- 
terized (Rasschaert et al., 1987; Britton et al., 1989; 
Wesley et al., 1989). The efficient propagation of PRCV 
both in vitro and in vivo suggests that the expression of 
this gene is not an absolute requirement. Incidentally, 
alteration of an ORF encoding a putative NS polypep- 
tide is not unprecedented in TGEV;; the NS3b ORF from 
the Purdue strain propagated in our laboratory was 
reported to have a mutated first ATG codon and, at least 
in some clones, a 13 base frameshift deletion in the 3’ 
region (Rasschaert et a/., 1987) when compared to the 
sequence from another laboratory (Kapke et al., 1988). 

Secondly, the genome of PRCV was found to encode a 
truncated S protein. Analysis of the S gene nucleotide 
sequence established from four cDNA clones predicted a 
1225 long amino acid precursor polypeptide (see Fig. 2) 
instead of 1447 (Purdue virus; Rasschaert et al., 1987) 
and 1449 amino acids (Miller and FS772/70 strains; 
Wesley, 1990; P. Britton, personal communication). The 
difference involves the omission in the PRCV § 
sequence of a continuous stretch of 224 residues starting 
at position 5 from the N terminus (Fig. 5). The sequence 
unique to TGEV S includes the 180 residue long segment 
where the antigenic site D has been mapped (Delmas et 
al., 1990). This finding is consistent with the observed 
lack of reactivity of Purdue site D-specific MAbs 
towards three different PRCV isolates (Laude et al., 
1988). Further, it strengthens the view that the trunca- 
tion of S protein, directly confirmed for the French 
isolate (Fig. 3), is a typical feature of this virus. Also of 
interest is the observation that one of the four MAbs 
defining site D (78.17) retained a significant reactivity 
towards PRCV, suggesting that the relevant epitope 
involves additional residues outside the TGEV-specific 
domain. Alignment of PRCV and TGEV S sequences 
revealed in addition 33 to 43 positions of non-identity, 
depending upon the PRCV-TGEYV strain pair (Fig. 5). 
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ATGGATTACTAAGGAAGGGTAAGTTGCTCATTAGAAATAATGGTAAGTT: 'TTGGTAACCACTTTATTAACACACCATGAAAAAATTATTTGTGGTCTTGGTTGTAATGCCATT 
: MK K LF vVVOLUvVVM PL 
ee a ee ed 


C4] 
GATTTATGGAGACAAGTTTCQTACTTCCGTAGTTICCAATTGCACTGATCAATGTGCTAGTTATGTGGCTAATGTTTTTACTATACTACCAGGAGGCTTTATACCATCAGATTTTAGTTT 
DywG DK F PTS VV S NET BP Q@ecas ¥ VAN VY F T IT LP EC EF F IPS BDF S F 


TAATAATTGGTTCCTCCTAACTAATAGCTCCACGTTIGGTTAATGGTAAAT TAGT TACCAAACAGCCTCTATTAGT TAATTGCTTATGGCCAGTCCCTAGCTTTGAAGAAGTAGCTTCTAC 
NN WF LLT YS S$ T LVN GK EV TK Q@PLLVY NCL we PVPS FP EEVA S8 T 


ATTITIGTTTTGAAGGTGCTGACTTTGATCAATGTAATGGTGCTGTTT TAAATAACACTGTAGACGTCATTAGGT TTAACCTTAATTT TAC TACAAATGTACAATCAGGTAAGGGTGCTAC 
F C F EB GA DF DQcCN GA VLWNN TVODVIRF NLWNELT INV QS GCGKGAT 


AGTGTTTTCATTGAACACAACGGGTGGTGTCACTCTTGAAATCTCATGT TATAA TGATACAGTGAGTGATTCGAGCTTTTCCAGTTACGGTGAAAT TCCGTTCGGCGTAACTAATGGACC 
v FS LNZIGGVTLEtTtscY¥Y NPWIMrvVW.sSs DOS SF SS Y GETPFPF GV TN G P 


ACGGTACTGTTACGTACTCTATAATGGCACAGCTCTTAAGTATCTAGGAACAT TACCACCTAGTGTCAAGGAGATTGCTATTAGTAAGTGGGGCCATTTTTATATTAATGGTTACAATTT 
R YY ¢C ¥ VL ¥Y MG&otA LK Y LEG T LP PS VK £& £ ATS K W GH F ¥ IN GY NW F 


CTTTAGCACATTTCCTATTGATTGTATATCTTTTAATTTGACTACTGGTGATAGTGACGTCT TCTGGACAATAGCT TACACATCGTACACTGAAGCATTAGTACAAGT TGAAAACACAGC 
F S T F P I Dc¢ctIs F NLratTGcé DPS DV F WTA Y TS Y TEA LV QVEN TA 


TATTACAAATGTGACGTATTGTAATAGTTATGT TAATAACAT TAAATGCTCTCAACT,TACTGCTAATTTGAATAATGGATTTTATCCTGTTTCTTCAAGTGAAGTTGGTTCTGTCAATAA 
IT NWVeTryc¢cN S ¥ V NN IT K C S QLUT AN LN N GCG F ¥ PV S&S S&S S EV GG S V UK 


GAGTGTTGTGTTACTACCTAGCTT TCTGACACATACCATTGTTAACATAACTATTGGTCT TGGTAT GAAGCGTAGT GGT TATGGTCAACCCATAGCCTCAACGCTAAGTAACATTACACT 
- v v LLP Ss F LTH TIVNITT IG LGM K RS GY GQPtTIAS TLS Hyp ys 


ACCAATGCAGGATAACAACAACGATGTGTACTGTGTTCGTTCTGACCAATTTTCAGTTTATGTTCATTCTACTTGCAAAAGTGTTTTATGGGACAATGTTTTTAAGCGAAACTGCACGGA 
P MQ DN NN DV YY C VRS DQ@F S VY¥V HS TC K S VLWODN VF K R NC TT D 


CGTTTTAGATGCCACAGCTGTTATAAAAACTGGTACTTGTCCTTTCTCATTTGATAAAT TGAACAATTACTTAACTTTTAACAAGTTCTGTTTGTCGTTGAGTCCCGTTGGTGCTAATTG 
vb DAT AV IK T G fT € PF S F DK LN N Y LT FN K F CLS LS PV GAN C 


TAAGTTTGATGTAGCTGCCCGTACAAGAACCAATGATCAGGTTGT TAGAAGTTTGTATGTAATATATGAAGAAGGAGACAGCATAGTTGGTGTACCGTCTGACAATAGTGGTTTGCACGA 
K F DV AAR TR TN DQVV RS LY vVIY £€EEGDSsS I ¥GYPS DNS GLH D 


TTTGTCAGTGCTACACCTAGATTCGTGCACAGATTACAATATATATGGTAGAACTGGTGTTGGTATTAT TAGACAAACTAACAGGACGATACTTAGTGGCTTATAT TACACATCACTATC 
LS VL HLD Ss €C T DY NITY GCG RT GVGTI I RQTNRT iI LCs GLY ¥Y T SL S 


TGGTGATTTGTTAGGTTTTACAAATGT TAGTGATGGTGTTATCTACTCTGTAACGCCATGTGATGTTAGCGCACAAGCAGCTATTATTGATGGTACCATAGTTGGGGCTATCACTTCCAT 
G DLULG*FtT?TNeyvys DG vtys vtTPe¢eobpDBVS AQAA ATI odDdDG TI VGA TIT § I 


TAACAGTGAATTGTTAGGTCTAACACATTGGACAACAACACCTAATTTTTATTACTACTCTATATATAATTACACAAAT GATAAGACTCGTGGCACTCCAATTGGCAGTAATGACGTTGA 
NS £OL kG Lb Tf HW T T TPN FY Y ¥ S TY MYT N DK T R GT PIG S$ ND VD 


TTGTGAACCTGTCATAACCTATTCTAACATAGGTGTTTGTAAAAATGGTGCTTTGGTTTTTATTAACGTCACACATTCTGATGGAGACGTGCAACCAAT TAGCACTGGTAACGTCACGAT 
c EPVtIT ¥ S NIG V¢cKN GA LV F I NMYV T HS DG DV QPitIs TG NY TP I 


ACCTACTAACTTTACTATATCCGTGCAAGTCGAATACATTCAGGTTTACACTACACCAGTGTCAATAGACTGTTCAAGATATGTTTGTAATGGCAACCCTAGGTGTAACAAACTGTTAAC 
PT NET IS VQwvVEY I @Q@vY¥Y T TPVS IDC FS R Y VCN GNP RCN K LOLT 


ACAATACGTTTCTGCATGTCAAACTATTGAGCAAGCACTTGCAATGGGTGCCAGACTTGAAAACATGGAAGTTGATTCCATGTTATTTGTTTCTGAAAATGCCCTTAAATTGGCTTCTGT 
Q¥ Vv S AC QT I EBeQA LAM GAR LE NM E VV DS M LF VS E NAL K LA S V 


CGAAGCATTCAATAGTTCAGAAACTTTAGATCCTATTTACAAAGAATGGCCTAATATAGGTGGCTTTTGGCTAGAAGGTCTAAAATACATACT TCCGTCCGATAATAGCAAACGTAAGTA 
EA F NS § E T LDP IY K EWP N IT GGFWOLeE GLK ¥ I LPS DN S K R K Y 


TCGTTCAGCTATAGAGGACTTGCTTTTTTCTAAGGTTGTAACATCTGGTT TAGGTACAGTTGATGAAGATTACAAACGTTGTACAGGTGGTTATGACATAGCTGACTTAGTATGTGCTCA 
R S AIEopDOLtL*F S& K Vvw?tw S G LGTY DED XY KR CT GEY DI AD LVCA Q 


ATACTATAATGGCATTATGGTGCTACCTGGTGTGGCTAATGCTGACAAAATGAC TATGTACACAGCATCCCTCGCAGGTGGTATAACATTAGGTGCACTTGGTGGAGGCGCCGTGGCTAT 
yY ¥ N GI MV LP GG VAN AD K MT M Y TAS LAGGTtTLGaALEEEAVA T 


ACCTTTTGCAGTAGCAGTTCAGGCTAGACTTAATTATGTTGCTCTACAAACTGATGTAT TGAACAAAAACCAGCAGATCCTGGCTAGTGCT TT TAATCAAGCTATTGGTAACATTACACA 
P F AV AV QA RUN ¥ VAL QT DV LN KN QQI tA S AF N QA IG LTT Q 


GTCATTTGGTAAGGTTAATGATGCTATACATCAAACATCACGAGGTCTTACAACTGTTGCTAAAGCATTGGCAAAAGT GCAAGATGTTGTCAACACACAAGGTCAAGCTTTAAGACACCT 
S F G K V N DA I#H®QTS R GLTT VA K ALA KV QOD VY VN TFT OG QA LRH OL 


AACAGTACAATTGCAAAATAATTTCCAAGCCATTAGTAGTTCTATTAGTGACATTTATAATAGGCTTGATGAATTGAGTGCTGATGCACAAGTCGACAGGCTGATCACAGGAAGACTTAC 
T VQ &b QN N F QA Is § S IS DI ¥ NRLUD ELS ADA QV DRL ITT GRL T 


AGCACTTAATGCATTTGTGTCTCAGACTCTAACCAGACAAGCCGAGGTTAGGGC TAG TAGACAACTTGC TAAAGACAAGGT TAATGAATGCGTTAGGTCTCAGTCTCAGAGATTCGGCTT 
A LN AF VS Q T LF R Q@ AE VR AS R QULA K DK VN EC VRS QS QR E GF 


CTGTGGTAATGGTACACATTTGTTTTCACTCGCAAATGCAGCACCAAATGGCATGATCTTCTTTCACACAGTGCTATTACCAACGGCGTATGAAACTGTGACTGCTTGGTCAGGTATTTG 
ec GNG&Tt HLF S$ L AN A A PN GCG M IT F F HT VV Lb ob PT AY E fF VT A WwW SS CTC 


TGCTTTAGATGTTGATCGCACTTTTGGACTTGTCGTTAAAGATGTCCAGTTGACTTTATTTCGTAATCTAGATGACAAGTTCTATTTGACACCCAGAACTATGTATCAGCCTAGAGTGGC 
A LDV ODRTF GLVYVK DV Q@tuT LF R NLD DK F Y LT PR TM ¥Y QP RVA 


AACTAGTTCTGATTTTGTTCAAATTGAAGGGTGCGATGTGCTGTTTGTTAATACAACTGTAAGTGATTTGCCTAGTATTATACCTGAT TATATTGATATTAATCAGACTGTTCAAGACAT 
T S$ S$ DF veitrtIeEeEGeDV LF Vv NTT VS DLP S&S TTP DY IT DI Kk GEV Qodii 


ATTAGAAAATTT TAGACCAAATTGGACTGTACCTGAGCTGACAT TGGACGTT TT TAACGCAACCTATT TAAACCTGACTGGTGAAATTGATGACTTAGAGTT TAGGTCAGAAAAGCTACA 
L EN F R PNW TT V PE LT oD VY F NAT Y L NLT G EIoDoDL E F RS E K LH 


TAACACTACTGTAGAACTTGCCATTCTCATTGACAACAT TAACAATACAGTAGTCAATCTTGAATGGCTTAATAGAAT TGAAACT TATGTAAAATGGCCTTGGTATGTGTGGCTACTAAT 
NIT VELA I UI DN I NNT VV NWN LE Woe NS RT ET ¥ VK Ww PRY VBWOEL I 


AGGCTTAGTAGTAATATTTTGCATACCATTACTGCTATTTTGCTGTTGTAGTACAGGTTGCTGTGGATGCATAGGT TGTTTAGGAAGTTGTTGTCACTCTATATTCAGTAGAAGACAATT 
GL VvVdI F € I PLL LF € € € $ TG € € GCG EeTtGEL Gi c Cc H S IT F S RUR Q@ F 
TGAAAATTATGAACCTAT TGAAAAAGTGCACGTCCATTAAATTTAAAATGTTAATTTTATCTCTGCTATAATATCATTTGITGT' GGATGATGAATAAAGAACTTCAAGI'CAGCAAAT 


EN Y E P I EK V HV EH 


TTACTAATACATCCGTGGACGTTGTACTTGGCGAACTTGATTGTGTATACTTTACTGTAACCCT TAAAGTAGAATTTAAGACTTGTAAATTACTTGTGTGCATAGGTTTTGGTGACATAC 
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Porcine coronavirus nucleotide sequence 


Co ecancnsbin 
TTCTTGCGGGTATTGAAGAAGTA Sa a aa ar aa 1 GATTGGTGGACTTT 4080 
NS3-19 M I GGL 5 


TTICTTAATACTCTGAGTTTTGTAATTGTTAGTAACCATCCTATCGT TAATAACACAGCAAATGTGCATCATATACAACAAGAACGTGTTATAGTACAACAGCACCATGTTGTTAGTGCTA 4200 
F LN T LS FV IVS NH P ITVN NWN T AN VY HHI QQ@ERV I V@eH HV VSA 45 


GAACACAAAACTATTACCCAGAGTTCAGCATCGCTGTACTITTIGTATCTTTTCTAGCTTTGTACCGTAGTACAAACTT TAAGACGTGTGTCGGCATCTTAATGTTTAAGATTTTATCAA 4320 
R TQ N Y Y P EF S$ T AVL EF VS F LAL Y¥Y RS TN F K T CV G&G F BM EF K IL S as 


TGACACTTTTAGGACCTATGCTTATAGCATACGGT TACTACATTGATGGCATTGTTACAACAACTGTCTTATCTTTAAGATTCGCCTACTTAGCATACTTTTGGTATGTTAATAGTAGGT 4440 
MTLLGePMLITIAY GY ¥ IDG IVTT TV OES LR FAY LA Y F WY VN S R 125 


TTGAATTTATTTTATACAATACAACGACACTCATGTTTGTACATGGCAGAGCTGCACCGTTTAAGAGAAGT TCTCACAGCTCTATT TATGTCACATTGTATGGTGGCATAAATTATATGT 4560 
FE F IL ¥Y N T T *T LM F V iH GR AA P F € RS S HS S EF ¥Y VT LY G6 GIN Y M 165 


TTGTGAATGACCTCACGTTGCATTT TGTAGACCCTATGCTTGTAAGCATAGCAATACGTGGCT TAGCTCATGCTGATCTAACTGTAGTTAGAGCAGTTGAACTTCTCAATGGTGATTTTA 46a0 
F VN DLTLdsH *F VDP MLV S IT ATR G LAH A DUT VVRA VY E LL NG DO F 205 


TTTATGTATTTTCACAGGAGCCCGTAGTCGGTGTTTACAATGCAGCCTTTTCTCAGGCGGT IE TAAAGGAAATTGACTTAAAAGAAGAAGAGGGAGACCGTACCTATGACGTTTCCTAGG 4800 


I ¥ v F S&S Q@Q EP VVGWVY¥Y N AA F S Q AVLN EIODL K E EB EG DR Yo ov s 244 

M T F PR 5 
GCATTGACTGTCATAGATGACAACGGAATGGTCATTAGCATCATTTTTTGGTTCCTGTTGATAATTATATTGATATTACTTTCAATAGCATTGCTAAATATAATTAAGCTATGCATGGTG 4920 
A LTV rt DDN GM VI S TIF WF LEI itptiuwbs tatLuUuN +i1éK LCM V 45 


TGTTGCAATTTAGGAAGGACAGTTATTATTGTTCCAGTGCAACATGCTTACGATGCCTATAAGAATTTTATGCGAATTAAAGCATACAACCCTGATGGAGCACTCCTTGTTTGAALTAAA 5040 
c c N LGRTVtItV PVvQ@keA Y¥ DAY K N FM ROI KAY NP DGAL LV 82 


TGAAGATTTTGTTGATATTAGCGTGTGCGATTGCATGCACATGTGGAGAACGCTATTGTGCTATGAAAGACGATACAGGTTTGTCATGTCGCAATGGCACGGCGTCTGATTGTGA $160 
M K I LL I LA cc AIA CC T C G ERY CAM K DDT GCG LS C RNGTAS DCE 39 
ne te NEN 


GTCATGCTTCAACAGAGGCGATCTTATTTGGCTTCTTGCAAACTGGAACTTCAGCTGGTCTATAATATTGATCATTTTTATTACTGTGCTACAATATGGAAGACCTCAATTCAGCTGGTT 5280 
Ss ¢ F NR GDLtIwWwtLA N WN F S WS T Tort tiFrFitrgvtuteyeéerRregqQeFtswe* 79 


CGTGTATGGCATTAAAATGCTTATAATGTGGCTATTATGGCCGATTGTTTTGGCTCTTACGATTTTTAATGCATACTCGGAATACCAAGTGTCCAGGTATGTAATGTTCGGCTTTAGTAT 5400 
VoeY GI K M LI MW LLWP IVLA LT IF NAY S EY Qgv¥S R ¥Y VM F GES I 119 


TGCAGGTGCAATTGTTACATTTGTACTCTGGATTATGTATTTTGTAAGATCCAT TCAGT TGTACAGAAGGACTAAGTCTTGGTGGTCCTTCAACCCTGAAACTAACGCAATTICTTTGCGT 5520 
A GA Iv TT F VL WiIiesM ¥ F V RS I Q@uLY R RT K S WW S F NP ET NATIT UC V 159 


TAGTGCATTAGGAAGAAGCTATGTGCT TCCTCTCGAAGGTGTGCCAACTGGTGTCACTCTAACTTTGCTTTCAGGGAATTTGTACGCTGAAGGGTTCAAAATTGCAGGTGGTATGACCAT $640 
S AL GRSsS ¥ V LP LEGGY PT GY TLTtT EE Ss GN LY AE CG F K TAG GEM TI 199 


CGACAATT TGCCAAAATACGTAATGGT TGCATTACCCAGCAGGACTATTGTTTACACACTTGTTGGCAAGAAGTTGAAAGCAAGTAGTGCGACTGGATGGGCTTACTATGTAAAATCTAA 5760 
DNLP K ¥Y VM VA LP S$ RT ITV ¥ T LV GK K OOK AS SAT GWA Y ¥Y VK S K 239 


AGCTGGTGATTACTCAACAGAGGCAAGAACTGATAATTTGAGTGAGCAAGAAAAATTATTACATATGGTATAACIAAAG!T. TGGCCAACCAGGGACAACGTGTCAGTTGGGGGGA 5880 
AGDYSTEARTDNLSEQEKLLUBRBM VY M AN Q G6 QR V S W G D 262/12 


TGAATCCACCAAAATACGTGGTCGCTCCAATTCCCGTGGTCGGAAGATTAATAACATACCTCTTTCATTCTTCAACCCCATAACCCTCCAGCAAGGTGCAAAATTTTGGAACTCATGTCC 6000 
Es T K IR GR S N S RGR K IN N IP LS F FN PTTL Q@ecGaA K F WN S CP $2 


GAGAGATTT TGTACCCAAAGGAATAGGTAATAGGGATCAACAGATTGGTTATTGGAATAGACAAACTCGCTATCGCATGGTGAAGGGCCAACGTAAAGAGCTTCCTGAAAGGTGGTTCTT 6120 
R DF Vv PK GIGNRo>Dbeegeit GY WN ROT R Y RM VK GQR K EL P ER WFP EF 92 


TTACTACTTAGGCACTGGACC TCATGCAGATGCCAAAT TTAAAGATAAAT TAGATGGAGTTGTCTGGGTTGCCAAGGATGGTGCCATGAACAAACCAACCACGCTTGGTAGTCGTGGTGC 6240 
Y Y LG TGP HA DA K F '*K ODK OLD GYVYVY WY AK DGAM N K PT TLE S RGA 132 


TAATAATGAATCCAAAGCTTTGAAATTCGATGGTAAAGTGCCAGGCGAATTTCAACTTGAAGT TAACCAGTCTAGGGACAACTCAAGGTCACGCTCTCAATCTAGATCGCGGTCTAGAAA 6360 
N N E S K A L K F DG K VP G EF QL EVN QS RDN S$ RS RS QS RS RS RN 172 


CAGATCTCAATCTAGAGGTAGGCAACAATCCAATAACAAGAAGGATGACAGTGTAGAACAAGCTGTTCTTGCCGCACTTAAAAAGTTAGGTGTT TACACAGAAAAACAACAGCAACGCTC 6480 
R S& @ S$ FR GR Q@QQS NN K KODODS VEQAVOLAA LK K LGV Y TEK QQAQR SES 212 


TCGTTCTAAATCTAAAGAACGTAGTAACTC TAAAACAAGAGATAC TACGCCTAAGAATGAAAACAAACACACCTGGAAGAGAACTGCAGGTAAAGGTGATGTGACAAGATTT TATGGAGC 6600 
R S K S K ER S N S K T R DT T P K N EN K H T WK RTA GK GODVTR EF Y GA 252 


TAGAAGCAGCTCAGCCAATT TTGGTGACAGTGACCTCGTTGCCAATGGGAGCAGTGCCAAGCAT TACCCACAAT TGGCTGAATGTGTTCCATCTGTGTCTAGCATTTTGTTTGGAAGCTA 6720 
R S S S AN F GDS DULvAN GS S A K H ¥ P QUA EC VPS VS S ILFGS Y 292 


TTGGACTTCABAGGAAGAT GGCGACCAGATAGAAGTCACGT TCACACACAAATACCACT TGCCAAAGGATCATCCTAAAACTGAACAATTCCT TCAGCAGATTAATGCCTATGCTAGCCC 6840 
wots K EDG Dg@tEeE Vv tT F TT H K Y *H LP K ODBPK TE QF LQQINA YA S P 332 


ATCAGAATTGGCAAAAGAACAGAGAAAAAGAAAGTCTCGTTCTAAATCTGCAGAAAGGT CAGAGCAAGAGGTGGTACCTGATTCATTAATAGAAAACTATACAGATGTGTTTGATGACAC 6960 
S ELA K E Q@ R K R K S R S$ K S A ERS EQEVVPODS LIEN ¥Y T DV FODOD T 372 


ACAGGTTGAGATGATTGACGAGGTAACGA AT GCTCGTCCTCCTCCAGGCTGTATGTATTACAGTTTTAACCTTACTACTAATTGGTAGACTCCAATTATTAGAAAGATTA 7080 
QvEM IDEVTN MLvVOLLQAV¢eCt.IT VL TLE LI G€ RL Qu L E R UL 382/27 


TIACTTAATCACTCTTTCAATCTTAAAACTGTTGATGATTTTAATATCTIATATAGGAGTTTAGCAGAAAC TAGATTAC TAAAAGTGGTGCTTICGATTAATCTTTTTAGTCTTATTAGGA 7208 
L &G N AS F N LK T V DD F N TL y RS LAE T REL K VV LDR GIF LV Lo 6G 67 


saad eka Pal en hee sen eB oe beads a Axess re cele 7320 
Fecex-,r Rk &LbLVt.I-LM 78 


ATAGGAGGTACAAGCAACCCTATTGCATATTAGGAAGTTTAGATTT Eo erence TTAAAGATCCGCTACGACGAGCCAACAATGGAAGAGCTAACGT 7460 


CTGGATCTAGTGATTGTTTAAAATGTAAAATTGTTTGAAAATTTTCCTTTTGATAGTGATACACAAAAAAAAAAAAAAA 7519 


Fig. 2. Sequence of the extreme 3’ 7519 nucleotides of the PRCV genome and deduced amino acid sequences of the major ORFs. The 
positions of the deletions relative to three TGEV strains (see Results) are indicated by the framed numbers | to 5 above the nucleotide 
sequence. The framed letters A and B show the positions of two sequence segments unique to the Purdue strain (see Discussion). The 
consensus intergenic sequences are boxed. The octanucleotide sequence conserved in the 3’ non-coding region of coronaviruses is 
overlined. The predicted ORFs are translated into the one-letter amino acid code. The 29 potential Asn-linked glycosylation sites found 
in protein S are underlined. Protein S and M signal peptides are indicated by bold lines. 
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Fig. 3. Comparative gel electrophoresis of PRCV and TGEV 
intracellular polypeptides. Cytosols from [35S]methionine-labelled 
infected ST cells were subjected to immunoprecipitation using a 
mixture of MAbs against Purdue strain (TGEV) structural polypep- 
tides. Immunoprecipitated material was resolved in a 9 to 20% 
acrylamide gradient SDS-PAGE gel. Lane 1, mock-infected cells; lane 
2, PRCV-infected cells (10° c.p.m.); lane 3, TGEV-infected cells 
(104 c.p.m.). M, of marker proteins are indicated in the left margin. 


In particular, one change, Ser to Phe, was present within 
the nine residue stretch expressing C epitopes (Delmas et 
al., 1990); this correlates with the failure of two site C- 
specific MAbs to recognize the PRCV S antigen (Laude 
et al., 1988). By contrast to site D- and C-specific MAbs, 
all site A- and B-specific MAbs exhibited a strong cross- 
reactivity towards PRCV S, despite the existence of 12 
amino acid changes in the region stretching from 
positions 300 to 512, where the relevant epitopes have 
been mapped (Delmas et al., 1990). This further supports 
our earlier assumption that biologically crucial determi- 
nants are expressed in this domain of the S molecule 
(Delmas ef al., 1986). 

The nature of the events responsible for the genomic 
diversity between PRCV and TGEV remains an open 
question. Both homologous and heterologous RNA 
recombinations are proposed to be a driving force for 
coronavirus evolution (see review by Spaan et al., 1988). 
Thus the possibility cannot be formally excluded that the 
sequences unique to TGEV result from insertional 
events. However, we believe that the idea that PRCV is 
derived from TGEV through successive deletion events 


Sy NS3-1 NS4 M N NS7 
PRCV- s MM NAA SSS 
: s NS3_NS3-1 NS4 
Miller VME... WZ. 


Purdue S NS3a NS3bNS4_M N NS? 
Eran LN ZIM SSN 


Fig. 4. Compared gene organization of PRCV and of three TGEV 
strains. The genes are designated according to the latest recommenda- 
tions of the coronavirus study group (Cavanagh er al., 1990). In 
particular, the second NS ORF downstream of the S gene is referred to 
as NS3-1 in the case of the Miller strain to acknowledge the fact that its 
transcription involves a distinct subgenomic RNA species (Wesley et 
al., 1989), in contrast to NS3b ORFs of Purdue and FS772/70 viruses. 
The PRCV ORF named NS3-1 is preceded by an intergenic sequence 
homologous to that of the Miller ORF NS3-1. The TGEV ORFs NS3a, 
NS3b, NS4 and NS7 were formerly named X2a, X2b, X1 and X3, 
respectively (Rasschaert et al., 1987). The diagram was drawn 
according to the sequence data cited in the Results section. 


is the most plausible. Recent data indicate that deletions 
might be responsible for the observed polymorphism of 
the S protein of mouse hepatitis virus (MHV). In 
particular, the S genes of three selected MAb-resistant 
variants of MHV type 4 have been shown to encode 
truncated S proteins with a large deletion of 142 to 159 
amino acids mapping approximately in the N-terminal 
two-thirds of the S1 subunit of S protein (Parker et al., 
1989). Besides, the S protein genes of two MHV strains, 
JHM (Schmidt et al., 1987) and A59 (Luytjes et al., 1987), 
harbour a sequence deleted in the same region relative to 
that of MHV-4. Examination of the TGEV nucleotide 
sequence did not reveal any feature indicative of a 
jumping of the RNA polymerase complex such as a site 
of strong secondary structure, or repeated sequences in 
the vicinity of the deletion site. The only noticeable 
feature was the presence of a heptanucleotide sequence 
AGTTTCC immediately adjacent to the unique se- 
quence of the TGEV S gene which is repeated 10 
nucleotides downstream ; however, no simple mechanism 
underlying a possible deletion process could be recon- 
structed on this basis. 

Another conclusion drawn from the comparison of 
PRCV and of TGEV strains sequences was a closer 
relationship of PRCV to the fully virulent Miller and 
FS772/70 strains than to the high passage Purdue strain, 
of which several sequence features were not seen in 
PRCV: (i) a six nucleotide deletion in the S gene, 
encoding the residues Asn and Asp at positions 151 and 
152 (asterisks in Fig. 5) and (ii) two short insertions of 16 
and 29 nucleotides located upstream and at the 3’ end of 
the NS3a ORF, respectively (corresponding positions in 
PRCV sequence noted A and B in Fig. 2). In addition, 
the PRCV ORF NS3-1 is immediately preceded by a 
sequence CTAAAC, as in the Miller strain (Wesley et 
al., 1989). Finally, pairwise comparisons of the different 
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224 aa unique to TGEV 


N T TQ s 
MKKLF VVLVVMPLIYGDKFPTSVVSNCTDOCASYVANVE TILPGGF IPSDFSENNWFLLTNSSTLVNGKLVTKOPLLVNC 80 
aE ES BD 
N ui TQ s 
A L G P 
LWP VPSFEEVAS TFCFEGADFDQCNGAVLNNTVDVIRFNLNFETTNVOSGKGATVFSLNTTGGVTILEISCYNDTVSDSSFS 160 
A G =e. FE 
D 
SYGE IPFGVTINGPRYCYVLYNGTALKYLGTLPPSVKE ITAISKWGHE Y INGYNFFSTFP IDCISFNLTTGDSDVEWTIAYT 240 
D H 
L 
SYTEALVOVENTAITNVTYCNS YVNNIKCSOLTANLNNGF YP VSSSEVGSVNKSVVLLPB} 320 
K H I 
400 
480 
- Vv 
TPE DVSAQAAI IDGTIVGAITSINSELLGLTHWTTTPNFYYYSIYNYTND 560 
Vv 
R AD 
KTRGTP IGSNDVDCEPVITYSNIGVCKNGALVF INVTHSDGDVOP ISTGNVTIPTNFTISVQVEYIQVYTTPVSIDCSRY 640 
R AD B 
G s 
VCNGNPRCNKLLTOYVSACOT IEQALAMGARLENMEVDSMLF VSENALKLASVEAFNSSETLDP I YKEWPNIGGFWLEGL 720 
s 
A 
KYILPSDNSKRKYRSAIEDLLFSKVVTSGLGTVDEDYKRCTGGYD IADLVCAQY YNG IMVLPGVANADKMTMYTASLAGG 800 
H D 
A 
ITLGALGGGAVAIPFAVAVOQARLNYVALOTDVLNKNOQOILASAFNQAIGNI TOSFGKVNDAIHOTSRGLTTVAKALAKVQ 880 
s 
DVVNTOQGQALRHLTVOLONNFOQAISSSISDIYNRLDELSADAQVDRLITGRLTALNAF VSQTLTROAEVRASROLAKDKV 960 
I s 
AP 
NECVRSQOSQRFGFCGNGTHLFSLANAAPNGMIFFHTVLLPTAYETVTAWSGICALDVDRTFGLVVKDVOLTLFRNLDDKF 1040 
P SG 
A FI 
YLTPRTMYQPRVATSSDF VQIEGCDVLFVNTTVSDLPSIIPDYIDINOTVOQDILENFRPNWTVPELTLDVFNATYLNLTG 1120 
A Bot 
L 
EIDDLEFRSEKLHNTTVELAILIDNINNTIVVNLEWLNRIETYVKWPWYVWLLIGLVVIFC IEIPLLLFCCCSTGCCGCIGCL 1200 
L 
c 
GSCCHSIFSRROFENYEP IEKVHVH : 1225 
c 


Fig. 5. Alignment of the S protein sequences of PRCV and two TGEV strains. The position of the segment omitted in the PRCV 
sequence is shown. Amino acid changes relative to Miller and to Purdue strains are indicated above and beneath the PRCV sequence, 
respectively. The asterisks mark two amino acids absent in the Purdue S protein. The location of the major TGEV antigenic sites C 
(open box) and A and B (filled box) is given (Delmas et al., 1990). 
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Table 1. Pairwise sequence homology between PRCV and 
TGEV polypeptides 


TGEV strain 
PRCV predicted polypeptides 


Purdue Miller FS772/70 
Number of ———— 
Designation amino acids aa* %+t aa % aa % 
s 1225 44 97 34. (97:30—~CO- = 
M 262 13. 95 ~j - li 95-8 
N 382 14 963 - - 9 976 
NS3-1 244 78 97-1 4 98-4 3 98- 
NS4 82 3 96-3 0 100 4 95-1 
NS7 78 7 91 - - 4 94-8 


* Number of divergent amino acids (aa). 

+ Percentage homology. 

t Sequence data not available. 

§ Compared with full-length NS3b ORF (Kapke et al., 1988). 


translated ORFs revealed a greater overall homology 
between PRCV and Miller or FS772/70 strains than 
between PRCV and the Purdue strain (Table 1). A 
further point to mention with respect to PRCV/TGEV 
sequence comparison is that the TGEV strains for which 
sequence data are available were isolated a long time 
before the PRCV outbreak. Therefore, one cannot 
exclude the possibility that the observed divergence in 
terms of point mutations reflects, at least in part, the 
natural drift of TGEV. 

No conclusion can be reached about the genome 
modification(s) controlling the phenotypic difference 
observed between PRCV and TGEV, namely the 
impaired multiplication of the former in the digestive 
tract. It is tempting to speculate that the truncation of the 
S gene is relevant as it is well known that the spike 
glycoproteins play a pivotal role in conferring tissue 
specificity to enveloped RNA viruses. In this respect, 
mutants of MHV encoding a point-mutated or a 
truncated S protein have been shown to be neuroatten- 
uated for mice, thus indicating that at least one virulence 
determinant is associated with this protein (Dalziel et al., 
1986; Fleming et a/., 1986; Gallagher et al., 1990). The 
omission of protein (about 30K) in the proximal region of 
PRCV S might affect, directly or by induced distant 
conformational change, a putative domain involved in 
the recognition of TGEV receptors on the enterocytes. 
Studies aiming to examine the pathogenicity for piglets 
of TGEV mutants in the region unique to the TGEV S 
gene are currently being developed as an approach to 
answer this question. In any case, the recognition that 
TGEV and MHV S proteins can sustain a large deletion 
each in a different region should help attempts to dissect 
the molecule functionally. 

To conclude, the present findings about PRCV 
together with those reported on MHV allow the 


proposition that genomic deletion as well as recombina- 
tion might be a major source of diversity among the 
coronavirus family. The studies on PRCV provide the 
first indication that a deleted coronavirus may emerge 
not only as the result of experimental selection but also 
spontaneously in the field and, furthermore, may have 
great epizootiological impact. 
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